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Abstract. A formula expressing cumulants in terms of iterated integrals of the distribu- 
tion function is derived. It generalizes results of Jones and Balakrishnan who computed 
expressions for cumulants up to order 4. 
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1. Introduction 

The expectation of a random variable can be computed in many ways. One method 
involving only the distribution function is obtained by careful partial integration and looks 
as follows: 



Oh ■ poo i>o 

^ EX= (1-F(t))dt- / F(t)dt- 

JO J-oo 

a similar formula holds for the variance, which can be written as the following double 
integral: 



(1.1) Var(X)=2 J J F(ti)(l - F(t 2 )) dt x dt 2 . 

. -00<tl<t2<00 

Analogues of these formulas expressing the third an fourth cumulants (skewness and kur- 
tosis) in terms of iterated integrals of the distribution function were computed some time 
ago by Jones and Balakrishnan |7J. The proof relied on ad hoc partial integration, see also 
' [2], where similar formulas for mean differences are considered. 

The aim of the present note is a generalization of these expressions to cumulants of 
arbitrary order, resulting in a formula resembling the well-known Mobius inversion formula, 
which expresses cumulants in terms of moments. 

The paper is organized as follows. After a short introduction to cumulants in Section [2] 
and a review of partitions and shuffles in Section [3] we give two proofs of the main result. 
The first one using an elementary identity for the Volterra integral operator and Chen's 
shuffle formula for multiple integrals is contained in Section HI In the concluding Section [5] 
we give another proof based on a formula for multivariate cumulants due to Block and 
Fang. 

2. Moments and cumulants 

Let X be a random variable with distribution function F(x) = P(X < x) — f* dF(t). 
Its moments are the numbers 

/oo 
r dF(t) 
-oo 
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Under some assumptions the sequence of moments contains the complete information 
about the distribution of X. It can be collected in the exponential moment generating 
function (formal Fourier-Laplace transform) 

^ W = Ee- = f;^» = l + ^ + ^ + ... 

n=0 

and the Taylor coefficients of the formal logarithm of the m.g.f. 

oo 

K r . 



iog^(*)=X3* B 



nl 

n=l 

are called the cumulants. The first two are expectation and variance 

K\ = mi = EX 
k 2 = m 2 — ml = Var X 

and after rescaling the following two are the skewness k^/k^J 2 and the kurtosis K4/ ' k\. 

The cumulants carry the same information as the moments but for many purposes in a 
better digestible form, e.g., after a translation the moments behave like 



(X + r) = f2( n h )r n ' k rn k (X) 

h— n \ / 



while the cumulants are 

s \t + Ki(X) 71 = 1 

(2.1) K n {X + r) = \ V 

(n n (X) n>2 

For this reason the cumulants are sometimes called the semi- invariants of X. The most 
important property of the cumulants is the identity 

K n (X + Y)=K n (X)+K n (Y) 

if X and Y are independent. 

3. Partitions and Shuffles 

There is also a combinatorial formula expressing the cumulants as polynomials of the 
moments. A set partition of order n is set 

7r = {B 1 , B 2 ,..., B p } 

of of disjoint subsets Bi C {1, 2, . . . , n}, called blocks, whose union is {1, 2, ... , n}. Denote 
n„ the set of all n-set partitions. It is a lattice under the refinement order 

7r < a ^=^> every block of 7T is contained in a block of a, 

with minimal element 6„ = {{1}, {2}, . . . , {n}} and maximal element 1„ = {{1, 2, . . . , n}}. 
Each set partition it g II n determines a number partition, called its type, \(ir) = l kl 2 k2 ■ ■ ■ n kn h 
n where kj is the number of blocks B G tt of size \B\ = j. For the combinatorial identities 
below we will employ the following conventions. For a partition A = l kl 2 k2 ■ ■ -n kn h n we 
abbreviate A! = l\ kl 2\ k2 ■ ■ -n\ kn and for a sequence (a n ) n6 N of numbers we denote 
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similarly for a set partition it we let 

a-ir = a\(n) = W a \ B \ 



BGtt 

Given a partition Ahn, the number of set partitions n E Tl n with \(n) = A is equal to the 
Faa di Bruno coefficient 

The well known moment-cumulant formula says 

(3.2) n n = ^ m K A i ( 7r ' 1») 

7rGn n 



where /i is the Mobius function on Il n . Its values only depend on X(ir) = l kl 2 k2 ■ ■ ■ n kn 
namely 

n 

/ i(vr,i„) = /iA W = n((- 1 ) J 0'- 1 ) ! )^ 

i=i 

Using the Faa Di Bruno coefficients (13. ip the moment-cumulant formula can be condensed 
to 

(3.3) K n = ^ < U [ m x fi X 



Ahn 



Definition 3.1. Let a = (a±, a 2 , ■ ■ ■ , a m ) and b = (b±, b 2 , . . . , b m ) be two finite sequences. 
A shuffle of a and b is a pair of order preserving injective maps (p : a — > {1, . . . , m + n} 
and ^ : 6 — ► {1, . . . , m + n} with disjoint images. When the two sequences are thought of 
as two decks of cards, this corresponds to putting the two decks together in such a way 
that the relative order in the individual decks is preserved, the card aj (resp. bj) being 
put in position </?(aj) (resp. ^{bj)). The result of the shuffle is the sequence, where each 
i is replaced by the symbol <£> -1 (i) (or ^ _1 (i)). Denote LU(m, n) the set of shuffles of the 
sequences (1,2,..., m) and (m + 1, m + 2, . . . , m + n). Shuffles of multiple sequences are 
defined accordingly and for a partition Ahn denote LU(A) the set of shuffles of disjoint 
sequences with cardinalities given by A. 

Each shuffle is uniquely determined by the subsets the individual sequences are mapped 
to and thus the number of shuffles is equal to the number of ways of picking these subsets, 
i.e., the multinomial coefficient 



#LU(A) 



On the other hand each shuffle r G LU(A) determines a partition 7r of {1, 2, . . . , n} of type 
A. However different shuffles may determine the same partition tt, if A contains entries of 
the same size. Therefore we have the identity 



#LU(A) = ki\k 2 \ ■ ■ - k ri 
which will lead to an interesting cancellation later. 
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4. Iterated Integrals 
Definition 4.1. The Volterra operator is the integral operator 

Vf(x)= I f(t)dt 

J — oo 

defined for suitable integrable functions / : R — > R; its powers are defined recursively by 

v n f(x)= r {v^fmdt 

J — oo 

It was first observed by Chen [I] and Ree [9] that the recursively defined iterated integrals 

«<i,.2,..,»^) = / oi il)h) ... )in _ 1 (u) da in {u) 



which can be written as 

doii^ti) da i2 (t 2 ) ■ ■ ■ da in (t r 



a<ti<t'2<-t n 

satisfy the shuffle relations 

a ii,i2,---,i m {ty a ji,j2,---,jn W ~~ ^ 1 Oiuit) 

a£UI({i 1 ,...,i m },{j 1 ,...,j n }) 

For the Volterra operator this means that for example 

r/(i) v n f{x) = Yl // " ' / /r(tl ' " ' ' tm+n) dtl dh '" dtm+n] 

T6m M ti<-<w» 

where f T (ti, . . . ,t m+n ) = /(^(i)) /(^(i)) witn V 9 and i> as in Definition ETTJ 

Note that if f(t) is a probability density function, then the corresponding distribution 
function is given by 

F(x) = Vf(x) 

We denote F^(x) = VF^(x) = V n+1 f(x) where F^(x) = F(x) = /f^ dF(t). This 
notation slightly differs from [2]. Then one can easily show by induction that these integrals 
are truncated moments. 



Lemma 4.2. 



FM( t ) = I f (r-trrfF(t) = i-E(r-X)I 
n! y_ 00 n! 



Before proceeding further, assume for the moment that the support of dF is bounded 
and that r is an upper bound. Then Y = X — r has moments 



y n = E(X - ,-)" = (-l)"«!f W (r) = ff) 



(-r) n fc m fc 



and because of (12. ip the cumulants are 

'k 1 (Y)+t n = l 



K n (Y) n>2 
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Thus we can express the cumulants of order n > 2 by a moment- cumulant type formula in 
terms of y n = (— l) n n\V n F: 

(4.1) K n (x) = y*n(*X) 

7ren n 

After some shuffling (in the literal sense!) this simplifies to the following formula. 
Theorem 4.3. 



(4.2) 



K n = (-l) n n\ //•■■/ *2, • • • : tn) M 71 "; W dti dt 2 ■ • • dt n 



t 1 <t 2 <--<tn 



where F w (t 1 ,t 2 , . . . ,t n ) = Ylne^ F(t a {B)) where a(B) denotes the first (i.e., smallest) ele- 
ment of a block B. 

Note that the higher order cumulants do not depend on r and thus the formula also 
holds if the support of X is unbounded. 

Remark 4.4. It may be hoped that this formula provides some insight to Rota's problem 
of the cumulants [TO], namely to find a collection of inequalities which are necessary and 
sufficient for a number sequence to be the cumulant sequence of some probability distri- 
bution. One advantage of formula (14.21) is the fact that all terms appearing in the sum 
are either nonnegative or nonpositive, regardless which particular probability distribution 
is considered, in contrast to the moments in Schiitzenberger's formula (13.21) . 

Proof. Assume first that the support of X is bounded. Starting from ([4,1]) . or rather (13.31) . 
we have 



K n — 

Ahn 



Ahn ' 

= (- 1 )"E(a} A!f[a1 ( t )^ 

Ahn ^ ' 

and by Chen's lemma this is equal to 

= ( _1 ) n Ehl A! E //"'/ F tT (t u ...,t n )dt 1 dt 2 ---dt n n x . 

Since F a only depends on the partition determined by a, we can collect equal terms to get 

= (-i) n Y,u} xlk ^ L -- k - l Yl JJ "J F n {t 1 ,...,t n )dt 1 dt 2 ---dt nf i x 

Ahn 7r~A „^.f w ^ w ^ 

— oo<ti<t2<---<t n <r 



7reIIn -00<t 1 <t 2 < --<t„<T 



Since the final formula does not depend on the chosen integration bound r, we may let it 
go to infinity and the formula holds for arbitrary distribution functions. □ 
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Remark 4.5. Using the recursive structure of the partition lattice, we can partially fac- 
torize the integrand of (I4.2p . Each partition tt G IT n can be constructed from a unique 
partition ir' G H n ~i by either adjoining {n} as a separate block or by joining n to one 
of the blocks of tt'. Assume that 7r' has k blocks. In the first case the number of blocks 
is increased to k + 1 and the Mobius function, which depends on the number of blocks, 
becomes 0,(71, l ra ) = —kfi(ir', l n _i). The integrand changes simply to F n (t) = F n >(t) F{t n ). 

In the second case the number of blocks remains the same and also the Mobius function 
and integrand stays the same. However there are k possible ways to join n to a block of 
7r'. Thus we can write 

K n (X) = (-l) n n\ // ••• J2 J2 (-kF„>(t 1 ,t 2 ,...,t n „ l )F(t n ) 

■J *J ** 1 1 _/ x-TT 



h<t 2 <--<t n 



k=i 7r'en n 

+ kF n , (tx,t 2 , • • • , tn-i) Mvr', ln-i) dti dt 2 --- dt r 



l) n n\JJ---J F 7r {t 1 ,t 2 ,...,t n - 1 ){l-F{t n ))ii{ir,i n -i)dtidt2---dt r , 



<l<t2<"<tr7 



The number of integration variables can still be reduced by two as discussed in [B]. 
For this purpose we introduce the so-called mean redidual life functions of Barlow and 
Proschan [1], namely 



[°°(1-F(t))dt 



P(y)=y-E(X\X<y) 



1 - F(y) 

iLnt)dt 



F(y) 

and obtain 

J M P (* 2 ) F -( t2 ' t2 ' • • • ' *»-0 (l-^(^n-l))i?(tn-l) yU(7T, l n _i) dt 2 dt 3 ■ ■ ■ dt n 

because F n (ti, t 2 , . . . , t„_i) always contains -F(ti) as a factor. 



«„(*) = (-l)*n! 



t2< 



5. Hoffding's formula and multivariate cumulants 

In this section we give another proof of Theorem 14.31 using an extension of Hoffding's 
formula due to Block and Fang [3|. Multivariate cumulants are defined as coefficients of 
multivariate Fourier transforms, namely 

Kn(X U X 2 , . . . , X n ) == ■ ■ ■ ^- logE e ^ 1 + -+z n X n 

The univariate cumulants in Section [2] correspond to the case X\ — X 2 — ■ ■ ■ — X n = X. 
As an example, the second cumulant is the covariance 

k 2 (X 1: X 2 ) = Cov(Xt,X 2 ) = E X\X 2 — E A x E A 2 . 

In general the multivariate analogue of (13.21) is 

(5.1) K n (Xi, X 2 , . . . , X n ) = ^ m n (X 1 ,X 2 , X n ) /i(7r, l n ) 

7ren n 
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where for a partition 7r e IT n we denote 

m 7r (X 1 ,X 2 ,...,X„)= n E II X - 

Ben ieB 

Hoffding's formula p2, [8] gives an alternative expression for the covariance in terms of 
distribution functions: 

k 2 {X x ,X 2 ) = Jj{F{t 1 ,t 2 )-F 1 (t 1 )F 2 {t 2 ))dt 1 dt 2 

where F(ti,t 2 ) = P(Xl < t\ A X 2 < t 2 ) is the joint distribution function and Fi and F 2 
are the marginal distribution functions of X\ and X 2 . From this it is easy to derive (11. ip . 

We shall use an extension of Hoffding's formula to cumulants of all orders, due to Block 
and Fang [3J from which Theorem 14.31 follows as a corollary. For the reader's convenience, 
we provide a short proof of Block and Fang's result here. For that purpose we introduce 
the following notations. 

Let Xi,X 2 ,...,X n be random variables and denote by F(t\, t 2 , . . . , t n ) = P(Xi < 
ti,X 2 < t 2 , . . . ,X n < t n ) their joint distribution function. More generally, for a subset 
I C {1,2, ... ,n} denote 

FjdU)^) = P(X { < ti : z e /). 
Define iterated integrals by the recursion 

F [0 '°'-'^(x 1 ,x 2 , ...,x n ) = F(xt,x 2 , ...,x n ) 
( ''' 2 ' F^-' ki+1 '-^( Xl ,x 2 ,...,x n ) = I"' F^-- k ^\x 1 ,x 2 ,...,t i ,...x n )dt i 



Then one can show by induction that 



(5.3) p\kiM,-,k n ] ( • • • , x n ) = E (Xi :f l)+1(X2 'f 2)+2 • • • 

hi k 2 \ k n \ 

Theorem 5.1 ([3]). For any n > 2 the n-th multivariate cumulant is given by 

oo 

K n {X u X 2 , ...,X n ) = (-1)" J J ■■■ J J2 F Ah,t 2 , • • • , tn) 1«) ^1 d *2 • • • dt n 

-oo ^e 11 " 
where for any partition it G n„ we denote 

F n (t u t 2 ,...,t n ) = Y[F B (ti : i <E B) 

Ben 

Proof. Assume first that the random variables Xi are bounded from above and choose 
upper bounds X{. Then the subscripts in (15.31) disappear and we have 

Ft 1 ' 1 "- 1 ] (xi, x 2 , . . . , x n ) = E(x! - X x ){x 2 - X 2 ) ■ ■ ■ (x n - X n ); 

because of translation semi-invariance we may use the modified moment-cumulant formula 

K n {Xi, X 2 , . . . , X n ) = (— l) n K n (xi — X\,x 2 — X 2 , . . . , x n — X n ) 



:-ir E 4 1 ' wl (^^2, 



5 Xn ) 

7ren„ 
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where fJ 1 ' 1 ''"' (xi,x 2 , ...,x n ) = YIbe-k '"' \ x i '■ * e B). Writing this out in terms of 
the recursion (15.2ft and noting that the result does not depend on the choice of the bounds 
Xi yields the claimed formula. □ 

Second Proof of Theorem \4-3[ If X x = X 2 = ■ ■ ■ = X n then the integrand 

F 7T (t 1 ,t 2 , ...,t n ) /i(7r, i n ) 

7ren„ 

is symmetric in t\, t 2 , ■ ■ ■ , t n , we may "shuffle" the integration variables and get 



K n {X^ X 2 , . . . , X n ) = {-l) n n\ J] "' J F*(ti,t2, ■ ■ ■ ,t n ) /i(vr, l n ) dt x dt 2 



dt n 



-oo<ti<* 3 <-<tn<oo 7rGl1 " 



and now observing that the joint distribution function of any subset satisfies 

Fj(ti :iel) = F(mm(ti : i G /)) 
we arrive at the claimed formula. □ 
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